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^ , Abstract: Wc show that an infinite Galton- Watson tree, conditioned on its mar- 

tingale hmit being smaUer than e, converges as e 4, in law to the regular /i-ary 
i tree, where /i is the essential minimum of the offspring distribution. This gives an 

, example of entropic repulsion where the limit has no entropy. 

! 1. Introduction and statement of the result 

The problem of conditioning principles can be formulated in the following way: Given that 
some quantity averaged over a large number of individual random variables shows highly 
unlikely behaviour, describe the conditional law of an individual sample. This situation 
arises frequently in statistical mechanics, where the random variables describe individual 
features of particles (for example their velocity) and the ensemble of particles is subject to 
some constraint (for example a fixed energy per particle). The distribution of the individual 
1/-^ I feature given the constraint is then referred to as the micro- canonical distribution of the 

system. The most famous result in this respect is the Gibbs conditioning principle, which 
loosely speaking says that under the condition that the empirical measure 
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of a family of independent random variables Xi, , . . . , X„ with law P belongs to some set A, 
the law of Xi converges to the probability measure Q that minimizes the relative entropy 
^ ■ H{Q \P) subject to the constraint Q € A. There exist several refinements of this result 

^ . describing rigorously the precise asymptotic strategy by which the random variables realize 

the large deviation event {L„ € A}. See the book of Dembo and Zeitouni [10] for more on 
the classical Gibbs conditioning principle, [H [TT| [T7] for refinements, and \12\ [T^ [T5] for 
further examples of conditioning principles. 

The conditioning principle of the present paper deals with Galton-Watson trees with a 
nondegenerate offspring variable N satisfying P{N = 0) = and EN log N < 00. Let 
a := EN be the mean offspring number. We denote by (Zn- n = 0, 1, . . .) the sequence 
of generation sizes of the Galton Watson tree and note that by definition Zq = 1. By the 
Kesten-Stigum theorem the martingale limit 

W := lim — 

n— >oo a"' 

is well-defined and strictly positive almost surely. Note that W can be seen as a random 
constant factor in front of a deterministic exponential growth term a^, which together de- 
termine the asymptotics of the generation size Zn- We are interested in the limit behaviour 
of Zi, or more generally of the entire tree, when we condition on the large deviation event 
that the martingale limit W is smaller than some e ^0. 
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For the formulation of the result we denote by T the space of all rooted trees with the 
property that every vertex has finite degree. A metric d on this space is uniquely determined 
by the requirement that d{Ti,T2) = e""", when n is maximal with the property that the 
trees Ti and T2 coincide up to the ?ith generation. This makes (T, d) a complete, separable 
metric space. 

Theorem 1. Suppose N is a random variable on the positive integers satisfying the condition 
ENlogN < 00, and denote 

H := min{n € N: F{N = n) > O} > 1. 

Suppose that T is a Galton-Watson tree with offspring variable N and that W is the as- 
sociated martingale limit. Then, as e J, 0, conditionally on the event {W < e} the tree T 
converges in law on (7~, d) to the regular fi-ary tree. Equivalently, for all /c S N, 

limP(Zfc = /'|VF<e) =1, 
where denotes the size of the kth generation. 

From the point of view of large deviations theory this result is quite surprising, at least at a 
first glance. One would expect that the limiting behaviour represents the optimal strategy 
by which the event W = Q \s realized and that this strategy depends on the details of the 
law of A^. Moreover, there seems to be no good reason why in the limit the growth rate of 
the tree should drop dramatically, or in fact why it should drop at all, as we only require 
the constant to be small. Above all, the probability of seeing a /i-ary tree up to the nth 
generation may be arbitrarily small and can certainly be much smaller than those of other 
trees with Z„ < ea". 

This becomes even more intriguing if the result is put in the context of entropic repulsion. 
This is an expression used by physicists to convey the idea that entropy maximisation may 
force certain systems to obey properties that are not obviously imposed on them a priori. 
This phenomenon has been studied mathematically in the context of the two-dimensional 
harmonic crystal with hard wall repulsion by Bolthausen et al , where the following result 
was proved. Consider the discrete Gaussian free field {^Px)x£Dn on a planar domain with 
mesh size 1/n. If the field is conditioned to be nonnegative everywhere, then the typical 
value of the field (fx at any point x in the interior of the domain will be highly concentrated 
near the value (4/7r) log n with overwhelming probability as n — )• 00, and in particular under 
this conditioning the value of ipx diverges to infinity. An analogous phenomenon is studied by 
Benjamini and Berestycki [2] and [3], where it is shown that conditioning a one-dimensional 
Brownian motion on some self-repelling behaviour may force the process to satisfy a strongly 
amplified version of the constraint. Usually, the reason entropic repulsion may arise is in 
order to increase the entropy of the system, i.e., make room for fiuctuations. Thus the 
eventual state of the system is a compromise between the energy cost of adopting an unusual 
behaviour and the entropic benefits. Theorem [1] may also be cast in this framework, as 
it shows that the effect of requiring the constant W to be small is to change the overall 
exponential growth rate from a to However, if the limiting state of the system is the 
regular /x-ary tree, which is non-random, what could the entropic benefits possibly be? 
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The resolution of this apparent paradox comes from understanding the inhomogeneity of the 
optimal strategy, and can be explained by a closer look at the formula 

While the growth rate log a is purely asymptotic, i.e. depends only on the offspring numbers 
after any given generation, the growth constant W depends heavily on the initial generations 
of the tree. It turns out that, roughly speaking, the collection of trees which form the 
optimal strategy to achieve W < e have minimal offspring for a few generations, the exact 
number depending on £, and causes high entropic and energetic cost but only for a small 
number of generations, and then switch to growth with the natural rate logo. The initial 
behaviour ensures that W is small at a mimimal probabilistic cost, because for all but a finite 
number of generations the trees can have their natural growth. The topology on T compares 
trees starting from their root so that in the limit we only see the behaviour in the initial 
generations. This leads to a limiting object with minimal growth rate at all generations and 
creates the illusion of a drop in the growth rate for the optimal strategy. A somewhat similar 
phenomenon is observed by Bansaye and Berestycki |T] in the context of branching processes 
in random environment, although they consider situations where the growth rate is directly 
conditioned to be atypical. 

Our interest in this 'paradox' does not come from the study of trees alone. Indeed, Morters 
and Ortgiese [16| describe a range of problems, mostly related to local times of Brownian 
motion, which have a similar intrinsic structure and could therefore also satisfy loosely 
analogous conditioning principles. However, we shall defer the discussion of such problems 
to a different place as, unlike in these problems, the main mathematical difficulty here is 
related to the discrete nature of the distribution N. 

2. Proof of Theorem 1 

Denote pk := P{N = k) and recall that by our definitions Po = • • • = P/^-i = 0, p^i > 0. The 
basic idea of the proof is is to combine tail asymptotics at zero for the random variable W 
with the self-similarity property of Galton-Watson trees, which states that, for every n E N, 

-| Zn 

i=l 

where Wi, i = 1,2,... are independent variables with the same law as W, independent of Z„. 
This follows easily from the decomposition of the tree according to the ancestry in the nth 
generation. 

We first give the proof in the case /i = 1, which is very simple as in this case the /x-ary tree 
is degenerated and has no exponential growth. In this case the tail at zero of the random 
variable W is fat, more precisely there exist constants < c < C such that 

ce^ <F{W <e) <Ce^ for ah < e < 1, 

where r := — logpi/loga, see [161 Theorem 1(a)] for a simple proof. Using ([T]) we infer 

P(Z„ >l,W <e) < P{Wi + W2< a"e) < P{W < a^'ef 

and hence 

F{Zrr>l\W <e) < (^a^--) ^ 0, 
as required to complete the proof in the case fi = 1. 
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Now we consider the case fi > 1 and assume that 7^ 1 to avoid triviahties. We define the 
Bottcher constant /3 G (0, 1) by 

A function V : (0, 00) — ?• (0, 00) is called multiplicatively periodic with period A 7^ 1 if V{Xx) = 
V{x) for all X > 0. Biggins and Bingham [H Theorem 3] show that there exists a real-analytic 
multiplicatively periodic function M: (0, 00) (0, 00) with period a^~^ = a/ fi > 1 such that 

-log¥{W < x) = M{x)x~^ +o{x~^) asxIO, (2) 

see also Fleischmann and Wachtel [13] for refinements of this statement. A key argument in 
the proof of ([2]) is to relate the left tail of a positive random variable to the behaviour of its 
Laplace transform at infinity in a way reminiscent of the Tauberian theorem of de Bruijn, 
see [H Theorem 4.12.9]. In the next lemma we generalise this result to the case of several 
independent copies of W using the same basic method of proof as in [^. 

Lemma 2. Let X be a positive random variable such that, for some a > 1 and /3 G (0,1), 
for all s > 0, 

lim l°gJg^^Pi-^«"^} =, k{s) 

with some real-analytic function k on (0,oo). Then there exists a real-analytic multiplica- 
tively periodic function V: {0,oo) (0, 00) with period a^~^ such that, for any m € N and 
Xi, . . . ,Xm independent with the same distribution as X, we have 

-logP(Xi H h < x) = mV{x/in){x/m)~~^ + o{x~^) as x i 0. 

Proof. Let Yn be real-valued random variables and denote, for s > 0, 

kn{s) := logEexp{sy„} e (—00,00]. 

Assume that for some sequence of positive numbers bn t 00, we have 

lim — =: k[s) € (—00,00]. 

n^oo bn 

The Fenchel dual of k is given by k*{x) := supg^gjxs — k{s)} G (—00,00]. By a variant of 
the Gartner-Ellis theorem, see [U Corollary 1], we have 

lim — — — ^ " ~ ^"^^ = k*{y) for all y G (limfc'(s), lim k'{s)). 

n->oo bn ^iO sfoo 

We apply this first to the sequence Yn := —a^-X and observe that k = k in this case. Note 
that k satisfies, by definition, k{s) = v{s)s^ for a multiplicatively periodic function v with 
period a. Using that k is strictly convex and decreasing we get lim^io k'{s) =: —5 < 0, and 

> lim k'{s) = lim fc'(a"+i) > lim ^^""^'j ~ ^^""^ = lim a^^"-'^ v{l) "^—^ = 0. 

stoo n-i-oo n^oo 0"+ — n— >cxd a — 1 

Therefore, 

n— ^00 O ^ 

Setting X = —y and rearranging, 

}To. an^x-mi-^) >=k*{-x)xT^=:V{x), for xG (0,5), 

where V is real-analytic and multiplicatively periodic with period a^~^ . 
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Now, consider Yn = + • • • + Xm)- We have 

^. logEexp{-sa"(Xi + --- + X^)} logEexp{-sa'^X} 

lim -s = m iim 3 = mkis), 

and hence 

1- -logn-«"(^i + --- + ^m)>a"^y) ; , . , . 

hm -3 = sup |ys — = mk [y/m) 

for all y G {—m6, 0). Setting x = —y and rearranging we obtain, for all x € (0, m5), 

r -logn^i + ---+^m<fl-"(^-^)x) / , A T/r / ^ rS 

lim 7777- — = mk (—x/m) x^-i^ = V (x/m) m^-i^ . 

Denote by Hn{x) the fraction on the left hand side, and by H{x) the right hand side of the 
display above. Further, denote 

Hn{x) := Hn{x) X 1-/5 and H{x) := H{x)x ^-P . 

Let / = [a~'^^^~^^m5,a~^^~^^mS\. Then Hn converges to H pointwise on /, H is continuous 
on /, and each Hn is decreasing. Hence Hn converges to H uniformly on /, see e.g. [5], and 
therefore converges uniformly to H on /. By the periodicity of V we have 

sup I X 1-/3 logP(XiH h < x) + y(x/m) mi-'5 1 = sup |i7„,(x) - if(x)|, 

and hence 

sup |xi-/3 logP(XiH V Xm < x) + V{x/m)m^-ii\ 

a;<a-^(l-/5)m<5 

= sup sup \ Hn{x) — H{x) I , 

n>N xel 

which converges, as N ^ 00, to zero as required. □ 

The next lemma states a basic property of real analytic, multiplicatively periodic functions. 

Lemma 3. Let V: (0, 00) (0,oo) be a real- analytic, multiplicatively periodic function and 
let J > 0. Suppose that B be a dense subset of [1, 00) such that 

liminf {V{e/b) h^' - V[e)) > for all beB. 

Then, for any b^ > 1, 

inf {V{e/b) 6^ - F(e)) > 0. 



E>0 

b>b. 







Proof. Define a real-analytic function /: M — t- M by f{y) = logy(e ^). Then 
V{e/b)b'^' - V{e) = exp{/(- loge + log6)}6^ - exp{/(- loge)}, 
and, substituting x = — loge and 5 = log 6, we obtain that it suffices to show that 

liminf (e-^(^+^)+^'' - e-^^^^) > for ah 5 G D := {log 6: b G B} (3) 

x^oo ' 

=^ inf (e^(^+'5)+T^ - e^(^)) > for any 6o > 0. (4) 

S>So 

By periodicity of /, the statement of ([3]) is equivalent to f{x + 5) + j5 > f{x) for all x G M 
and 5 ^ D. As i? is dense in [1, oo), D is dense in [0, oo) and, using the continuity of /, 

/(x + 5) + 7(5 > /(x) for aU x G M and 5 > 0. (5) 
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Suppose that is not true and the infimum is equal to zero. Since / is periodic and 



hm inf {e^("+^)+^^-e^(")| 



oo, 



the infimum in Q is attained at some point (x, 6). As this infimum is zero, we infer that 

f{x + 6) + j6 = fix). (6) 

Let T] € [0, 6]. Using ([5|) for the points x, rj and x + r], 6 — r], we obtain f{x + rj) + ^rj > f{x) 
and f{x + 6) + 7((5 — r?) > f{x + r/). The second inequahty together with Q imphes 
fix) > f{x + rf) + jT], which together with the first inequahty gives f{x + ij) = f{x) — jr] 
for all r] € [0,6]. Hence / is linear and non-zero on [x,x + 5]. As it is real-analytic it must 
be linear on M, contradicting the periodicity of /. □ 



We now return to the study of the martingale limit W. A result from [5] states that 

logEexp{-sa"Ty} , , , 
hm 3 = k[s) 

for some real-analytic function k on (0, oo). Using Lemma[2]we infer from this that, for some 
real-analytic and multiplicatively periodic M : (0, oo) — )> (0,oo) with period a^~^ we have, 
for any m € N, 

-logP(VFiH bWm<e) =mM{e/m){e/m)~^ +o{e~^) as e i 0. (7) 

Using first ([1]) , then ^ , and finally and periodicity of M, we get 



logP(iy <£\Zn = m) = logP^^Wi < ea" 



4 = 1 



/ \ J- pi __P_ P . 

= -M{£a^/m) a ^~^e i-" + o(e ^-p) 

= -M{efi'^/m) {m/i^iJ')^ + o{e'^). 

Combining with ([7]) again we obtain 

logP(W^ < e) - logP(W^ <e\Zn = m) 

/ 1 \ 13 13 (8) 

= (M{en''/m){m/fi'')^ - M(e)J + o(e~^). 

Lemma [3] enables us to analyse the bracketed term. 
Lemma 4. For any bQ > 1, we have 

inf {M{e/b)b^ - M{e)) > 0. 

Proof. Since M is real-analytic and multiplicatively periodic, it suffices to check that it 
satisfies the assumptions of Lemma [3] with 7 = 1/(1 — For fixed n S N we define 

Bn := {m/fi" : P(Z„ = m) / O} 

and B = UnenBn- We now show that B is dense in [l,oo). Indeed, as 7^ 1 there exists 
u > fi with pi, ^ 0. Denote d = v — fi. We can prove by induction that 

An := {m//i": fi"^ < m < i^"" ,m = ^"(mod d)} C Bn- 

This is obvious for n = 1. Assuming that P(Z„_i = r) 7^ for all r such that /^"""^ < 
r < r = ^"~"^(mod d) we obtain that P(Z„ = m) ^ for all m such that there is r 

satisfying the conditions above and such that fir < m < vr, m = /i"(mod d). It is easy to 
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see that this is equivalent to the condition jjP' < m < ,m = /i'^(mod d). Hence An C 
and since U^eN^n is dense in [1, oo) we obtain that UneMBn is dense in [1, oo). 

Let b ^ B, that is, h = for some m and n. We have 

P(Z„ = m\W <e) ¥{W <£)= < e, Z„ = m) = r{W <e\Zn = m) P(Z„ = m) 
and so 

eio F{W <e\Zn = m) eio F{Zn = m\W <e) - ^ ' 



Hence 



hni^inf { log r{W < e) - log r{W < e | Z„ = m) } > -oo. (9) 



Combining ([8]) with ([9]) we obtain 

liminf {M[e/b)h^ - M{e)] = liminf (M[e^L''/m){m/i/'-)^ - M{e)] > 0, 
as required. □ 



We now complete the proof of Theorem 1. Fix n € N and use Lemma H] to find c > 
(depending on n) such that M{e/b)b~ - M(e) > 2c for ah e > and 6 > 1 + For all 
m > /i" + 1 we have m/fj,'^ > 1 + and hence ([8]) implies 

logP(V^ < e) -logP(V^ <e\Zn = m)> ce~^ . 

Therefore 

oo ^ ^ 

< ^ exp|-ce"i^|p(Z„ = m) < exp|-ce"~| ^- 

m=fi"+l 

as e J, 0, completing the proof of Theorem 1 in the case fi > 1. 

Remark 1. Lemma H] can be seen as an illustration of the near-constancy phenomenon 
(see [4] and references therein), which consists in the fact that the function M does not vary 
too much. Some numerical studies show that the variation of M can be very small, and close 
theoretical bounds for M are obtained for the case of an infinitely divisible distribution. No 
theoretical framework yet exists to describe the near-constancy of M in the general case. 
Lemma m implies that 

M{x){y/x)^ - M{y) > for aU < x < y, 

1 

and so, with g{x) = x ^-^ , the function x i— )• M{x)g{x) is decreasing. As the fluctuations of 
M do not destroy the monotonicity of the decreasing function g they cannot be too large. 
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